Basic Statistics

Raw Counts

Name Value
Rows 28,698
Columns 46
Discrete columns 24
Continuous columns 22
All missing columns 0
Missing observations 0
Complete Rows 28,698
Total observations 1,320,108
Memory allocation 7.9 Mb

Percentages

Data Structure

Missing Data Profile

Univariate Distribution

Histogram

Bar Chart (by frequency)

## 8 columns ignored with more than 50 categories.
## ac_title: 27323 categories
## open_dt: 221 categories
## scheme_cd: 64 categories
## bsr_activity_cd: 230 categories
## industry_name: 217 categories
## branch_name: 3091 categories
## district_name: 452 categories
## Date.of.Birth: 9269 categories

QQ Plot

Correlation Analysis

## 11 features with more than 20 categories ignored!
## ac_title: 27323 categories
## open_dt: 221 categories
## scheme_cd: 64 categories
## bsr_activity_cd: 230 categories
## industry_name: 217 categories
## bsr_org_cd: 25 categories
## branch_name: 3091 categories
## district_name: 452 categories
## region_name: 48 categories
## state_name: 37 categories
## Date.of.Birth: 9269 categories
## Warning in cor(x = structure(list(ConsumerID = c(425804L, 427555L, 430231L, : the standard deviation is zero

Principal Component Analysis

## 8 features with more than 50 categories ignored!
## ac_title: 27323 categories
## open_dt: 221 categories
## scheme_cd: 64 categories
## bsr_activity_cd: 230 categories
## industry_name: 217 categories
## branch_name: 3091 categories
## district_name: 452 categories
## Date.of.Birth: 9269 categories
## Warning in plot_prcomp(data = structure(list(ConsumerID = c(425804L, 427555L, : The following features are dropped due to zero variance:
##  * MobileNo_Avl_Flag_1